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Simulations of partially coherent focal plane imaging 
arrays: Fisher matrix approach to performance evaluation 
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ABSTRACT 

Focal plane arrays of bolometers are increasingly employed in astronomy at far- 
infrared to millimetre wavelengths. The focal plane fields and the detectors are both 
partially coherent in these systems, but no account has previously been taken of the 
effect of partial coherence on array performance. In this paper, we use our recently 
developed coupled-mode theory of detection together with Fisher information matrix 
techniques from signal processing to characterize the behaviour of partially coherent 
imaging arrays. We investigate the effects of the size and coherence length of both 
the source and the detectors, and the packing density of the array, on the amount of 
information that can be extracted from observations with such arrays. 

Key words: instrumentation: detectors, methods: numerical, methods: statistical, 
techniques: image processing, infrared, submillimetre 



1 INTRODUCTION 

Large format focal plane imaging arrays are of increasing importance in many areas of submillimetre wave astronomy. Ar- 
rays utilis ing semiconducting bolometers have already been built for the Stratospheric Observatory For Infrared Astronomy 



rays utilising semiconducting bolometers nave already been built tor the stratospheric Ubservatory .tor Intrared Astronomy 
(SOFIA) jEvans et all 120021 ) and SHARC II l|Dowell et al.ll2003h. each of wh ich utilises 12 x 32 arrays. Fu ture arrays are 



planned using transition edge senso rs, including SCUB A 2 (|Holland et al.ll2006l ). GISMO dStaguhn et al 



the Green Bank T elescope (GBT) ( Dicker et ahll2006h . Atacama Cosmology Telescope (ACT) ( Fowler 
I Maffei et al.ll2005h . The largest of these planned arrays is of size 64 x 64 pixels 



2006) and arrays for 



20041 ) and CLOVER 



The construction of imaging arrays such as these, all of which operate at wavelengths between 0.05mm and 3mm, presents 
difficulties in understanding their behaviour. Crucially, no account has previously been taken of the effects of coherence on the 
performance of an array. This is important for two reasons. Firstly, at the wavelengths concerned telescopes are few moded, 
resulting in a partially coherent field in the focal plane, with correlations between different pixels. Secondly, the detectors 
used in the arrays are only a few wavelengths in size (in the case of ACT less than a wavelength), and should be expected 
therefore to show partially coherent behaviour themselves. If partially coherent imaging arrays are to be used successfully for 
astronomical observations, it is essential to understand their behaviour. 

In the pa st, understanding of partially coherent d etectors was limited by the lack of practical modelling techniques. In a 
recent paper ( Saklatvala. Withington fc Hobsonl 2007 ) we developed a general theory of partially coherent detection; in this 
paper we apply the technique to the simulation of imaging arrays. Qualitatively, coherence has two effects: it increases the 
noise associated with the statistical variation of the incoming field, and leads to correlations in this noise. We employ a Fisher 
matrix analysis to gain insight into how these effects alter array performance. In our simulations, we compare the ability of 
different arrays to recover information about some simple sources. Specifically, we focus on the size and coherence both of 
the detectors and the sources, and the packing density of the array. We anticipate that such analyses will prove useful in the 
design of the next generation of astronomical focal plane bolometric imaging arrays. 
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2 THEORY OF PARTIALLY COHERENT DETECTORS 



In earlier papers IjWithington fc Y assin 2001, 2002; ISaklatvala. Withington fc Hobsonll2007i ) we derived the following expres- 
sion for the average power absorbed by a paraxial multimode power detector from a statistically stationary source: 
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I duJ ^2Yl / dri / dr 2 Z* j (r 1 ,r2,Lj)Y i j(r 1 ,r 
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where Yij(n, r%, w) is the cross-spectral density 



Yij(ri,r 2 ,uj) = 



(Ei(ri,t)E*(r2,t + u))du . 



(1) 



(2) 



and Ei(ri,t) is the ith component of the analytic signal associated with the incoming electric field at position n and time t. S is 
a surface normal to the wavevector characterizing the input of the detector, and the sums are over the two polarizations normal 
to the wavevector. Zij (n, V2, w) is a quantity characterizing th e detector which we call the 'detector cohe rence tensor'. We have 
deriv ed HJ in several ways, using thermodynamic arguments ( Withington fe Yassinl 2001 ). reciprocity ( Withington fc Yassinl 
20021 ). and general arguments about the properties of detectors jSaklatvala et al.ll2007t ) The result is effectively a weighted 



linear sum of all pairs of space-time correlations in the incoming field, and may be interpreted also in terms of the coupling 
between the natural modes of the incoming field and a set of modes that characterizes the detector. The power of our theory 
is that, in addition to calculating the average detector outputs, it also allows us to relate the statistics of the incoming fields 
to the statistics of the detector outputs. For a thermal source, for an integration time much longer than either the coherence 
time of the source or the response time of the detector, the covariance of the power recorded by two detectors is 
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(r 1 ,r 2 ,uj)Yij(r 1 ,r2,LL>) 



(3) 



where r is the integration time. (ED can be derived using e ither a semi-classical (jSaklatvala et al.l 120071 ) or quantum optical 
I Zmuidzinaj 20031 ; Withington. Hobson fc Saklatvala 2005 ) approach. ([3]) gives both the fluctuatio ns of the detector outputs 
and t he correlations between the outputs, and thus incorporates the Hanbury Brown-Twiss effect ( Hanburv Brown fc Twissl 
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In most applications, a frequency filter is placed in front of a detector to restrict the bandwidth (this is necessary to 
avoid blurring of the image, in the case of an array, or ambiguity in the baselines, in the case of an interferometer). As an 
idealization, we consider a filter that multiplies the cross-spectral density by a top hat function with central frequency wo and 
bandwidth Au>. To avoid blurring, the bandwidth must be sufficiently small that the cross-spectral density and the detector 
coherence tensor are approximately constant across the bandwidth of the detector, in which case 



(P) ~ A^^^ / dri / dr 2 Zi j (r 1 ,r 2 ,<Jo)Yi j (ri,r2 S 
i—i .-1 Js Js 
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For numerical work we must discretize the integrals in Q and ([5]). The idealized detectors we simulate in this paper respond 
equally to all polarizations, so defining 

Z a/3 = (Aa;) 2 Z ij (r a ,r /3 ,a;o) (6) 

(7) 

(8) 
(9) 



Y Qf3 = ^2,Yij{r a ,rp, 
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for some regular grid of sample points {r a } with spacing Ax, we obtain 
(P) ~ AwTrZY , 
and 

Cov[P a ,P b ] ~ — ^TrZ a YZ b Y + 5 a6 ^ TrZ a Y 
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Note that we have used italic letters to denote the continuous tensors and Roman letters to denote the discretized forms. 



In th e single mod e case, the signal-to-noise ratio P/y Cov[P, P] obtained from (JSj and ([9]) reduces to the Dicke radiometer 
equation ( Dickell946h in the high occupancy limit, and Poisson noise in the low occupancy limit. In this paper we are interested 
in the long wavelength, high occupancy limit, so we neglect the second term of 



3 PARAMETER ESTIMATION FROM IMAGING ARRAYS 



A crucial factor in measuring the performance of an imaging array is the amount of information that can be extracted 
from it. To this end, we employ the technique of Fisher matrix analysis, which is widely used in many areas of a stronomy 
I Tegmark. Taylor fc Heavens! 1 19971 ; iHamiltonl [l997l ; lYamamoto et al-lkoOll ; iMartins et allbooi iRocha et al"1l2004 ), particu- 
larly the constraint of cosmological parameters from CMB measurements, and has also been applied to the design of radio 
interferometry arrays IjStoica fc Nehorail ll98Sl: Ubramovich et al.lll996l ; Udorj I1996T ). but has not hitherto to our knowledge 
been employed in the context of focal plane imaging arrays. The Fisher matrix analysis technique provides a way of estimating 
the uncertainties and correlations in the parameters of a model fitted to some data, averaged over all realisations of the data, 
and without the need to enumerate specific realisations. In this paper we perform simulations to estimate the constraints on 
the parameters of a source when observed with imaging arrays of different size, coherence and packing density. 

Given a data column vector D and a model TL with parameter column vector 0, the Fisher information matrix is equal 
to minus the expected curvature of the logarithm of the likelihood: 



F l3 [0,H] = -E 



8 2 



86,89 



■lnPr(D|0,ft) 



(10) 



For a uniform prior, the posterior is proportional to the likelihood, so the Fisher matrix is also equal to minus the expected 
curvature of the logarithm of the posterior. Applying the Gaussian approximation to the posterior, we see that the covariance 
matrix of the recovered parameters can b e estimated by i nverting the Fisher matrix for the parameter values at the maximum 
of the posterior. Indeed it can be shown ( Marzetta 19931 . and see Appendices [5] and [Bj that the inverse of the Fisher matrix 
provides a lower bound for the covariance matrix of any unbiased estimators of the parameters (the Cramer-Rao bound). 

The Fisher matrix at the maximum of the posterior can itself be estimated from the mean and covariance of the data, 
and the derivatives of these quantities with respect to the parameters. Applying the Gaussian approximation to the likelihood 
function, we can show (see Appendix [Cj that 
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n[0,H])(D-n[0,H]) 
= 0o, satisfying 

Pr(D\0,H) = 0. 

J e=e 

To model an imaging array, we use the results (|8} and ((9} and set 



Px{D\0,H){D - n\0,H\){D -n[0,H]ydD 



are evaluated at 
E 9 



Di = <P l > 



Cov[P\P J ] , 



(11) 



(12) 



(13) 



(14) 



(15) 
(16) 



where P l is the output of the ith detector. With this substitution it can be seen that the first term of (|lip is proportional to 
tAcj, and therefore proportional to the ratio of integration time to coherence time; whereas the second term is independent 
of this ratio. For <(9j to hold the integration time must be much greater than the coherence time, and hence we can ignore 
the second term of (lll|) . We see that the covariance of the recovered parameters scales with the reciprocal of the number of 
coherence times per integration time. As the ratio of integration time to coherence time tends to infinity, the uncertainty in 
the parameters therefore tends to zero, but in reality, the integration time is limited by practical factors. 

Note that although the final result is the same as in previous astronomical applications, the physics is significantly 
different: the fluctuations and correlations depend on the parameters of the model, whereas in previous work the noise terms 
have been independent of the parameters. It is clearly possible to add parameter independent noise terms to (|9]), but since 
the purpose of this paper is to demonstrate the effect of coherence on imaging arrays, we ignore these additional terms. 
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4 PARAMETRIZATION OF SOURCES AND DETECTORS 

In this paper we perform simulations to investigate the ability of different imaging arrays to extract information from a source. 
We therefore need suitable parametrizations for both the sources and the detectors. 



4.1 Parametrization of sources 



The performance of an imaging array will clearly depend on the nature of the sources being observed; there is no universal 
measure of absolute performance. We can however gain considerable insight by investigating the abilit y of imaging arrays w ith 
different coherence lengths and geometries to determine the parameters of a Gaussian-Schell source ( Collett fc Woli 19781 ): 



*<"-> " ^(- lB -^- fJ -^) ■ («) 

where r c = (x c ,y c ) is the position of the source, / is the total power, a s is the geometrical width and a g is the coherence 
length. There are several advantages to using such a source in our simulations. Firstly, it is often a good approximation to the 
coherence function in the focal plane of a telescope when either a point source or incoherent Gaussian source is present on 
the sky. Secondly, there is a clear modal interpretation: the natural modes of a Gaussian-Schell source are the Gauss-Hermite 
modes. Thirdly, there are sufficiently few parameters that the Fisher matrices generated can be easily interpreted. There is no 
explicit reference to wavelength in the Gaussian-Schell parametrization, but in practise the wavelength dependence is implicit 
in the coherence length and geometrical size of the source. 
In the simulations we use the parameter vector 

= (x c ,y c ,I,a s f . (18) 

The coherence length a g is not included as a parameter to be recovered: our simulations verified that it is highly degenerate 
with the intensity, and thus leads to a poorly conditioned Fisher matrix. As the coherence length of the incoming field in the 
focal plane depends primarily on the telescope optics and is therefore usually known, whereas the intensity of the source is a 
parameter to be determined, we include the intensity but not the coherence length in our parameter vector. 

In Section [5.31 we will consider a slightly more complicated source, namely a pair of identical Gaussian-Schell sources. 
If the sources are not correlated with each other, the coherence tensors can just be added, hence the overall source coherence 
tensor has the form 



Y(n,r 2 ) = j= _ exp 



|ri - r 2 \ 2 



x ^ exp f \ri - r c - r scp /2| 2 + \r 2 - r c - r scp /2| 2 ^ + ^ / |n - r c + r scp /2\ 2 + \r 2 - r c + »WgH 

where r c = (x c ,y c ) is the position of the midpoint between the sources. r scp = (x aep , y se p) is the separation vector between 
the sources. We define the parameter vector for the double Gaussian-Schell source as 

6 = (x c , y c ,x SC p,y scp ,I,a s ) T . (20) 
In order that the posterior is unimodal (and hence that the Gaussian approximation can be applied) , we impose the requirement 



4.2 Parameterization of bolometric detectors 

In general the coherence tensor of a detector will depend on the details of the physics. For the purpose of this paper however 
we require a general parametrization that is a reasonable approximation to a typical detector, and that incorporates the most 
important features. Therefore we consider square pixels of uniform absorbtivity, and with a Gaussian correlation function. 
Such a model is partially coherent and hence multimode. Thus the detector coherence tensor of the ath detector centred at 
(x a , y a ) is given by 

Zi j (r 1 ,r 2 ) = Q(x! - (x a - s/2))e((x a + s/2) - x 1 )Q{x 2 - (x a - s/2))e((x a + s/2) - x 2 ) 

e(yi - {y a - s/2))@({y a + s/2) - yi )Q{y 2 - (y a - s/2))@((y a + s/2) - y 2 ) exp (-tl^p^j , (21) 

where s is the detector size, c is the detector coherence length, r n = (x„,y n ), and Q(x) is the Heaviside step function. 
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Figure 1. Estimated 80% marginalized confidence contours for the recovered parameters of a Gaussian— Schell source, observed with 



detectors have coherence length c = 5, and integration time r = 100/Acj. On the top row (a-c), the detector are of side length 15 units, 
and are thus close packed, while on the bottom row (d-f) they are of side length 3 units, and are therefore sparse. The source is at the 
centre of the array (x c = 0) in the plots on the left (a, d), while the plots in the middle (c, e) and on the right (d, f) have x c = 7.5 and 
x c = 15 respectively. 



5 SIMULATIONS 

We perform all simulations on a uniform 51 x 51 grid. We use (J6]l and ((TJ to perform the discretization, (|15[) . (|16[) . p| and 
© to calculate the data vector and covariance matrix, and (|1 1[) to calculate the Fisher matrix. In the parametrizations for 
source and detector that follow, we measure all lengths in number of grid points; our results can straightforwardly be scaled 
to physical units. We set the ratio of integration time to coherence time to be 100. Numerical differencing with a step size of 
0.001 was used to calculate the derivatives dDk/d8i. We verified, by using polynomial interpolation techniques to calculate 
the derivatives to machine precision in a few random cases, that numerical differencing gives more than sufficient accuracy 
for the purpose, as well as being much faster. Inversion of the covariance matrix is problematic due to its poor conditioning. 
To m inimise numerical errors and make the inversion stable we therefore employ the pseudo-inverse (|Moorelll920l : I Penrose! 



1955), discarding singular values below one millionth of the largest singular value. This cut-off is arbitrary, but we found our 



results were stable through several orders of magnitude of cut-off. Physically, we can justify the use of the pseudo-invers e 



from the fact that the signal-to-noise ratio falls off rapidly at the edge of the source ( Saklatvala. Withington fc Hob son 2006), 
and hence these detectors do not contribute significantly to the Fisher matrix. The effect of taking the pseudo-inverse of the 
covariance matrix is to discard these same detectors; hence the result is almost identical to that which would be obtained 
using the true inverse on a machine of unlimited precision. 



5.1 Parameter recovery for a Gaussian— Schell source 

Figure [T] shows the 80% marginalized confidence contours for the recovered parameters of a Gaussian-Schell source, estimated 
using the Fisher matrix, when the source is observed with close packed and sparse arrays (see figure caption for details). We 
observe that all parameters are better constrained with the close packed array, as expected. In all the plots shown, the width 
of the source is more tightly constrained than its position; this can be predicted from the fact that the source is constrained 
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Figure 2. Estimated uncertainties on the recovered parameters (x c and y c have the same uncertainties), and the Mth root (where M 
is the number of free parameters in the model, equal to 4), of the estimated allowed volume of parameter space, for a Gaussian-Schell 
source with parameters x c = y c = 0, / = 1, cr s = 10 and <r 9 = 5, observed with arrays of detectors of various size and coherence length 
(c). In the first row of plots (a), the detectors are close packed, with one detector centered at (0, 0), and a minimum total size of 51 X 51 
units. In the second row (b), the array contains 5x5 detectors centered 15 units apart, such that the array becomes increasingly sparse 
as the detector size is reduced. 



in the model to be circularly symmetric, and thus changing a s affects the extent of the source in both x and y directions, and 
thus affects the output of more detectors than a change in the x or y coordinate alone. When the source is at the centre of the 
array, the parameters are uncorrelated except the width and intensity, which are anti-correlated. In some of our subsequent 
simulations the width and intensity are positively correlated. A common theme of this paper is that while our simulations 
produce all the results that could be straightforwardly predicted, they also uncover many other effects that might not have 
been expected. 

Figure [1] also shows the effect of moving a source away from the centre. We choose the displacements so that the source 
is centered midway between detectors (centre), and centered on a detector adjacent to the central one (right). We note that 
there is a minimal difference when the source is centred between detectors, even for the sparse array, where the gaps are 12 
units, and thus greater than a s . This suggests that as long as neither the gaps between the detectors nor the detector spacing 
are much larger than a s , the position of the source does not affect the amount of information that can be recovered. The 
implication is that the shoulders of the Gaussian intensity distribution are the important features, not the central peak. Our 
simulations show on the other hand that losing part of the source off the edge of the array has a dramatic effect on the estimate 
of the displacement, though less so on the other parameters. It also introduces a correlation between the displacement and 
the width. When the source is centered on a detector adjacent to the central one, it is centered roughly la s from the edge of 
the array, and these effects are seen clearly. 



5.2 Effect of detector size and coherence on close— packed and sparse arrays 

We have seen that sparse arrays of small detectors are less effective at recovering source parameters than close packed arrays 
of large detectors, but now we investigate in detail the effects of both detector size and coherence, and distinguish between 
the effect of detector size and the effect of the number of detectors in the array. 

Figure [2] shows the estimated uncertainty on the recovered parameters of the same Gaussian-Schell source as in Section 
15.11 with close packed and sparse arrays of detectors of various size and coherence (see figure caption for details). We also 
show the quantity \F~2m\, where F is the Fisher matrix and M is the number of parameters in the model. This quantity 
can be interpreted as the Mth root of the estimated allowed volume of parameter space, and gives a measure of the overall 
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Figure 3. Estimated uncertainties on the recovered parameters, and the Mth root of the estimated allowed volume of parameter space, 
for a Gaussian-Schell source observed with close packed and sparse arrays of detectors of various size and coherence length. The sources 



and arrays are identical to those in Figure [2] except that the coherence length of the source is increased such that a g 
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Figure 4. Estimated uncertainties on the recovered parameters, and the Mth root of the estimated allowed volume of parameter space, 
for a Gaussian-Schell source observed with close packed and sparse arrays of detectors of various size and coherence length. The sources 
and arrays are identical to those in Figure [2] except that the source is incoherent (a g = 0). 
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Figure 5. Estimated uncertainties on the recovered parameters, and the Mth root of the estimated allowed volume of parameter space, 
for a Gaussian-Schell source observed with close packed and sparse arrays of detectors of various size and coherence length. The sources 
and arrays are identical to those in Figure [2] except that the geometric size of the source is reduced such that cr s = 5. 



uncertainty in the measurement. If the parameters are uncorrelated, it will be equal to the geometric mean of the uncertainties 
on the individual parameters, but if they are correlated it will be less. 

We can identify many trends. The more coherent the detector the less accurately the source parameters can be determined, 
as would be expected from our detector model, in which more coherent detectors have inferior signal to noise. Furthermore, 
this effect becomes more significant as the detector size increases; again this is to be expected, as the coherence makes little 
difference to a small detector. A final general comment is that the uncertainty in the intensity follows smooth trends for both 
close packed and sparse arrays, while the behaviour of the position parameters, which is more complicated, is always the same 
as that of the width. 

For the close packed arrays, the uncertainty of the parameter estimates generally increases with detector size. However, 
the detailed behaviour is much more complicated. The incoherent detectors give almost identical accuracy regardless of the 
size. The uncertainty on the intensity increases progressively with detector size whereas the uncertainties on the position and 
width rises and falls twice, but with an underlying increasing trend. The tendency to less accurate parameter recovery with 
increasing size can be attributed only in part to reduced spatial resolution, as the effect is much less for incoherent detectors. 
Rather the trends seen appear to be caused by the details of the coupling of the natural modes of the detector to the natural 
modes of the source. 

In the case of the sparse arrays with fixed number of pixels, we find in all cases that the uncertainty in the intensity falls 
gradually as the detector size increases, while there is a step change in the other parameters at a size of around 11 units. The 
detailed behaviour depends on the coherence length; for the coherent detectors, the uncertainty actually increases with size 
up to a threshold, then falls, whereas for the less coherent detectors the uncertainty falls gradually up to the threshold, then 
suddenly, before plateauing. Furthermore, the precise threshold size depends on the coherence length. 

We now investigate how the size and coherence length of the source changes our results. Figure [3] shows the same 
information as Figure [2] except that the coherence length of the source is increased fourfold. We see a number of changes 
in the behaviour. The effect of the coherence of the detector is now generally much less pronounced. The uncertainty on the 
intensity is almost independent of detector size. The uncertainty on the position and width for the close packed arrays show 
the same complicated behaviour as before, but the peaks are shifted. For the sparse arrays, we still see the step change at 
around 11 units, but almost no variation below that threshold. The coherence length of the detector does affect the position 
of this transition, hence when the size is around 11 units the detector coherence length does make a significant difference. 
Finally we see that the uncertainties on all parameters are greater than for the less coherent source, as expected, because the 
signal to noise is worse. 
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Figure 6. Estimated 80% marginalized confidence contours for the recovered parameters of a pair of identical Gaussian-Schell sources, 
observed with 5x5 arrays of detectors centered 15 units apart. In all cases the source has parameters x c = y c = 0, y S ep = 0, J = 1, 
u s = 10 and a g = 5, while the detectors have coherence length c = 5. On the top row (a— c), the detector are of side 15 units, and are 
thus close packed, while on the bottom row (d-f) they are of side 3 units, and are therefore sparse. The sources are separated in both 
the x and y directions with x sep = y aep = 10 (a, d), x aep = y aep = 15 (b, e) and x aep = y aep = 20 (c, f). 



Figure [4] shows the same information as Figure [2] except that the source is incoherent. The opposite effects are seen as in 
Figure [3] the detector coherence length is much more important, the step change for the sparse arrays is still present, though 
less obviously for the incoherent detector, whereas the other trends are more pronounced. The peaks and troughs for the 
position and uncertainty with the close packed arrays are shifted yet again, though not predictably. The uncertainties on all 
parameters are less than for the partially coherent sources, as expected. 

Figure [5] shows the same information as Figure [2] except that the size of the source is halved. Most of the trends are very 
similar, but the obvious difference is the sharp peak in the uncertainty of the position and width estimates for the close packed 
array with size 11 units. There is also some slight discrepancy in behaviour between the position and width estimates, which 
was not seen in Figure [2] More telling however is the fact that the step change for the sparse arrays is not shifted either by 
changing the size or coherence of the source. It is not therefore simply caused by matching the size of the detectors to either 
size or coherence length of the source; rather the most important factor seems to be the fraction of the focal plane covered 
by the detectors. Finally, the errors on the position and size are less than in Figure [2] though the error on the intensity is 
greater. 

In this section, we have seen that apart from a few predictable trends, most of the behaviour of imaging arrays is quite 
complicated and not possible to predict by straightforward physical arguments. The complexity appears to be due to the 
details of how the natural modes of the source (in this case the Gauss-Hermite modes) couple to the modes of the detectors. 
Indeed, when the length scales of source and detectors are similar, and the sources and detectors are partially coherent, there is 
no reason to expect simple behaviour. Our simulations have demonstrated the importance of performing detailed simulations 
whenever partially coherent arrays are to be designed. Nevertheless, there are a number of key trends to emerge. Firstly the 
coherence of both detector and source have a significant effect on the parameter recovery. Secondly, the packing density needs 
to be above a certain threshold for information to be recovered. This packing density is roughly independent of the source 
parameters, and corresponds to roughly 50% of the focal plane being filled. 
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Figure 7. Estimated 80% marginalized confidence contours for the recovered parameters of a Gaussian— Schcll source, observed with 
17 X 17 close packed arrays of detectors of size 3 units and coherence 10 units, but with only 8x8 channels, so each channel is attached 
to 4 detectors. In the top row (a-c), 2x2 clusters of adjacent detectors are joined to the same channel, while in the bottom row (d— f) 
the detectors on each channel are interlaced, such that there is one interposing detector between detectors on the same channel. In all 
cases the source has parameters x c = 0, I = 1, a s = 20. The source has coherence length a g = (a, d), rj g = 10 (b, e) and <r g = 30 (c, f). 

5.3 Parameter recovery for a double Gaussian— Schell source 

We now consider a slightly more complicated source: the double Gaussian-Schell source denned by (119[) . Figure [6] shows 
the 80% confidence contours of the parameters of such sources with various separation vectors, when observed with both 
close-packed and sparse arrays. As the sources are moved apart in the x direction, x c becomes better constrained, but the 
other parameters become less well constrained, due to part of the source moving off the edge of the array. The same effect is 
seen for both close packed and sparse arrays. The source parameters can be equally well inferred even when the sources are 
strongly overlapping. However, this is not to say that the sources can be resolved: in order to determine whether two sources 
are resolved, it is necessary to compare models, for instance a pair of identical circularly symmetric sources with a single 
elliptical source. Fisher matrix analysis is not sufficient for this purpose; however, our coupled mode theory of detectors allows 
the higher moments, and hence the full likelihood function of the detector outputs to be calculated, and thus, in principle, 
it enables a detailed Bayesian analysis to be carried out. For the purpose of this paper, we simply consider Figure [6] to be a 
demonstration of how the Fisher matrix technique can be applied to complex sources of known form. 

5.4 Reduced channel arrays: effect of interlacing detectors on the different channels 

In a real imaging array, there is a multiplexing problem, and so it is often desirable to make the number of channels smaller 
than the number of detectors, by connecting multiple detectors to the same channel, such that the output of the channel is 
the sum of the outputs of the individual detectors. The question then arises: given a certain number of detectors and a certain 
number of channels, what is the best way to group the detectors feeding into each channel? The most obvious configuration 
is to have contiguous blocks of detectors on the same channel, but the complexity of behaviour we have seen in our previous 
simulations suggests that in some circumstances, it may be desirable to interlace the detectors on the different channels. 

We consider a close packed 17 x 17 array of detectors of size 3 units and coherence length 10 units, but with 4 detectors 
attached to each channel. In the top row of Figure [7| we show the recovered parameters of sources of different coherence 
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lengths where the 4 detectors on each channel form a 2 x 2 contiguous block. In the bottom row, the detectors on the different 
channels are interlaced such that the 4 detectors on each channel are at the corners of a 3 x 3 block. 

For the incoherent source, all the parameters are less well constrained for the interlaced channels than the contiguous 
ones; for the partially coherent source (<r 9 = 10), the parameters are better constrained for the interlaced channels, and for 
the fully coherent source (a g = 30), the position and width are worse constrained, but the intensity is better constrained, for 
the interlaced array. The effects are quite small because the arrays remain close packed, and the "effective pixel size" varies 
only from 6 to 9 units; Figures [2] to [5] would suggest a relatively small effect over this range. However, it is possible that 
with many detectors on each channel, and with the detectors on each channel spread over a wider area, these effects may be 
amplified. The complexity introduced makes further exploration beyond the scope of this paper; however it is a potentially 
important area for future work. 



6 CONCLUSION 

We have combined our coupled-mode theory of detection with Fisher information techniques from signal processing to study 
the performance of imaging arrays. Several of our results have important implications for array design. We have found that 
there is a threshold packing density above which the detectors must be packed for information to be recovered, and that this 
threshold has little dependence on the size and coherence of the source. Above and below this threshold, the packing density 
makes relatively little difference. As long as the source is not much smaller than the detector spacing, the same information 
can be obtained regardless of its position, even for sparse arrays. On the other hand, the coherence length of both source and 
detectors makes a significant difference to the amount of information that can be obtained. Most importantly of all, we have 
shown that detailed simulations are necessary for understanding the behaviour of imaging arrays. The reason for this is that 
the behaviour results from the coupling between the natural modes of the array and the natural modes of the source; when the 
source and detectors are multimode, the behaviour thus becomes too complicated to be explained by straightforward physical 
arguments. 

Numerous extensions to our work are possible. One could study the effects of photon occupancy for instance, or use a 
more sophisticated model for the detectors. One could consider more complicated array geometries, and explore the issue of 
interlaced pixels in more detail: our results in this paper are inconclusive, but enough to suggest that interlaced pixels may be 
preferable in certain conditions. Perhaps most interestingly, one could investigate whether putting the outputs of the detectors 
through a correlator enables more information to be recovered, and can be used to mitigate the poor signal to noise ratios for 
detectors placed in highly coherent fields. 



APPENDIX A: ALTERNATIVE DEFINITION OF THE FISHER MATRIX 



The definition of the F isher matrix in (|10[) is that used by most astronomers; however there is an equivalent definition 
( Kendall fc Stuartlll967l ). that is necessary to derive the Cramer-Rao bound. Define the score vector V[0,W] by 



Vi[e,n] = ^inPr(D\e,H) . 



(Al) 



We now show that the Fisher matrix is equivalent to the covariance of the score. From the definition of the score in (|A1[) we 
can show trivially that 



E[Vi] = , 

and therefore the covariance is given by 

~dlnPr{D\0,H) dlnPr(D\0,H) 



(A2) 



Cov[Vi[0,?4yi[0,w]] = e 



Pr(D|0,H)^4^^4^dD 



/ 



oe t do, 

dPr(D\0,H) dPr{D\0,H) 



Px{D\0,n) de, 

The Fisher matrix, as defined in ([10}, is given by 

d 2 \nPx(D\6 ,H) 



dD 



(A3) 



F %J [6,H] = -J Pr(D\6,H)- 



-dD 



d 2 Pr(D\6,H) 



dD 



dPr(D\e,H) dPx(D\e,n) 



Pr(D\e,H) 



dD 



(A4) 
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But 

32 



(A5) 



so by comparing (|A4|I with (IA3[) we find that 

K 3 [6>,H] = Cov[Fi[6>,W],yi[0,W]] , (A6) 
which is the alternative definition of the Fisher matrix. 

APPENDIX B: DERIVATION OF THE MULTI-PARAMETER CRAMER-RAO BOUND 

Consider a statistic vector T = t(D), that is an unbiased estimator of the parameter vector 0, i.e. 

E[T] = 6 . (Bl) 
Let C be the covariance matrix of the estimators: 

C = E[(T - 0)(T - 0) T ] (B2) 
Now, using the definition of the score in (IA1|I , we have 

E\ViTi] = J Ti^Pr(D|0)d£> 

= &j • (B3) 

Now form the matrix E[(T — 6 — WV)(T — Q — WV) T , where W is some arbitrary matrix. By forming a quadratic form 
with some arbitrary vector x, it is clear that 

E[(T - - WV)(T -0- WV) T ] > , (B4) 

i.e. the matrix on the left of the inequality is positive semi-definite. Expanding the left hand side of (|B4[I . using (|A6|I . (|A2[) 
and (|B3|) . and rearranging, we obtain 

C > W + W T - WFW T . (B5) 

Each matrix W gives us a different lower bound: to find the maximum lower bound we diagonalize the Fisher matrix 
F = QAQ T . It can then be easily shown that 

W + W T - WFW T = QAQ T - (WQ - QAT 1 )K{WQ - QAT 1 ) 1 . (B6) 
Hence the maximum lower bound is obtained for W = F . Substituting back in (|B5|) we obtain 

C > F' 1 . (B7) 

APPENDIX C: FISHER MATRIX FOR GAUSSIAN LIKELIHOOD 

The Gaussian approximation to the likelihood function is 

Pr(Z?|0 ' H) = (2^)M/2js[0, H ]|V2 exp (-\(D - ^M) T V-\0,H]{D - »[0,H\)) , (CI) 

where M is the number of parameters in the model. Henceforth, for brevity, we do not show the dependence of fi and £ on 
the model and its parameters explicitly in our notation. We can thus write 

lnPr(£»j6>,H) = — i(D — /z) T £ _1 (.D — fi) — TV/2 ln(27r) — — In £ , (C2) 
dlnPi(D\0,H) a M T v -i, n l gjnjgj l, n > T 9S->, n s , n „s 

— m — = ^r s v-ti-2-mr-iP>-rt-Mr { P-ti (C3) 

From (|10[) the Fisher matrix is given by 
'a 2 lnPr(£»|0,H)" 



Fa \0,H] = -E 



''""3 



_ i a 2 i n |s| i ( em- 1 *. 



a^ 2 a^a^ 2 v s^. 
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For the parameter values that maximize the likelihood, we set the expectation of the first derivative equal to zero, to 
obtain 

and thus 

*™-3£"-'f^M3r£)- 

Finally we can write 



aft y v^j 9ft 

- 2Tr feleTj + Tr (v s dft S aej 

i.^'^^). (07) 



and thus we obtain (fTTj) . 
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